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TOLL-LIKE RECEPTOR BINDING PEPTIDES AND METHODS OF USE 

FIELD OF THE INVENTION 
This invention relates to novel peptides and methods of use thereof. In particular, 
5 there is provided novel toll-like receptor (TLR) binding peptide molecules useful in 
modulating inflammatory responses in immune-mediated diseases, ranging from 
autoimmunity to cancer to infectious diseases. More specifically, this invention relates to 
selecting heat shock proteins (HSP)-derived peptides capable of interfering with or 
modulating (e.g., inhibiting) proinflammatory responses resulting from and/or related to 
10 the interaction between HSPs and derivatives thereof and TLRs on immune cells. 

BACKGROUND OF THE INVENTION 
H» a * gWk proteins Toll I jke Receptors 

HSPs are highly conserved molecules that act as chaperones involved in the 
15 folding of newly synthesized proteins. In addition, they transport antigens and, after 
internalization, they mediate antigen-specific cytotoxic T cell, as well as Th cell- 
dependent, responses, During stress, synthesis of HSPs is up-regulated in prokaryotic, as 
well as in eukaryotic, ceUs. Moreover, mtracsllular redistribution of HSPs, and the 
expression of these molecules on the cell surface, has been reported. 
20 Recently, the stimulatory capacity of extracollular HSP60 and HSP70 on the 

innate immune system has been recognized. Human and murine macrophages respond to 
both bacterial and human HSP60 with the release of proinflammatory mediators such as 
TNF or IL-6 and of the Thl-promoting cytokines IL-12 and IL-15, suggesting that 
HSP60 might act as a "danger signal" for the innate immune system. The cell surface 
25 receptor for HSP60 has not yet been completely characterized. Increased local expression 
of HSP60, and presumably other HSPs, in target tissues has been found in various 
inflammatory diseases such as rheumatoid arthritis, insulitis, and atherosclerosis. It has 
been reported that HSP60 is an important target for immune responses during chronic 
inflammation or atherosclerosis T cell responses to autologous HSP60 therefore have 
30 been suggested to regulate inflammatory or auto-aggressive immune reactivities. 
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There is evidence that CD14 and TLR 2 and 4 are involved in HSP60-mediated 
cell activation. However, binding to macrophages appears to be TLR4-independent 
Recently, a report on HSP60-induced signaling pathways in macrophages has been 
published, suggesting that the stress-activated protein kinases c-Jun N-terminal kinase 
5 (INK) and p38 mitogen-activated protein kinase (MAPK), as well as I B kinase, may be 
activated after endocytosis of HSP60 by murine macrophage. These kinases induce 
transcription factors like NF-KB that are responsible for cytokine (e.g., TNF) gene 
transcription. 

Interestingly, HSP60 and 70 are also antigens for the adaptive immune system, 
10 and T cell responses to epitopes from these HSPs regulate inflammatory diseases like 
rheumatoid arthritis, insulin dependent diabetes mellitus, and artherosclerosis. 

Indeed, autologous HSP60 has been suggested to control the balance of T cell 
responses in inflammation and autoimmunity. Activation of T cells is induced through 
their interaction with antigen presenting cells (APCs), for example, dendritic cells (DCs). 
15 DCs reside in an immature form in non-lymphoid tissues where they efficiently capture 
foreign antigens. Upon activation by pathogens or by inflammatory stimuli like TNF or 
IL-1, DCs migrate to lymphoid organs where they potently activate antigen-specific T 
cell responses. During migration, DCs undergo functional and phenotypic changes 
termed as "maturation." For example, they up-regulate surface molecules like MHC 
20 class 1 and II molecules, CD86, CD40, and CD54, and, thus, become potent inducers of T 
cell activation. Maturation is further driven by CD40-CD40 ligand interaction upon 
contact of DCs with T cells. In parallel, DCs lose their capacity for endocytosis and de 
novo MHC class II synthesis during migration to the lymphoid tissues. This feature 
enables DCs to preserve the antigen taken up in the periphery for antigen-specific T cell 
25 activation. In addition, mature DCs are able to release large amounts of proinflammatory 
cytokines like TNF or IL-1 and of the Thl-promoting cytokines IL-12 and IL-18 but they 
also secrete counter regulatory DL-10. Therefore, DCs are not only unique in induction of 
naive T cell activation, but also play a decisive role in Th cell polarization. 

Turning now to TLR subject matter, recognition of pathogens is also mediated by 
30 a set of germ line-encoded receptors that are referred to as pattern-recognition receptors 
(PRRs). These receptors recognize conserved molecular patterns (pathogen-associated 
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molecular patterns), which are shared by large groups of microorganisms. We have 
discovered that TLRs function as the PRRs in mammals and play an essential role in the 
recognition of microbial components. The TLRs may also recognize endogenous ligands 
induced during the inflammatory response. Similar cytoplasmic domains may allow 
TLRs to use the same signaling molecules used by the interleukin 1 receptors (IL-IRs); 
these include MyD88, IL-lR-associated protein kinase and tumor necrosis factor 
receptor-activated factor 6. Evidence is accumulating that the signaling pathways 
associated with each TLR are not all identical and may, therefore, result in different 

biological responses. Thus, there has been no disclosure of using TLRs in a specific and 

deliberate manner to affect disease states in the medical field. 

Therefore, there is a need in the art for inventions such as that described herein, 

which provides a use for peptides derived from various peptides, e.g., HSP60, to interact 

with TLRs for the medical purpose of modulating immune response in patients. 

SUMMARY OF THE INVENTION 
In a first aspect, isolated TLR-binding peptides are provided. In certain preferred 
embodiments, the peptides of the invention comprise core TLR-binding motifs. In 
former related embodiments, these motifs comprise amino acid sequences of human and 
bacterial HSP (e.g., hsp60) proteins. The peptides of the invention provide isolated 
amino acid molecules comprising polypeptides having at least about 70% sequence 
identity, preferably at least about 80% sequence identity, more preferably at least about 
90% sequence identity, and most preferably at least about 95% sequence identity with a 
TLR-binding peptide that comprises an amino acid sequence according to any of the 
sequences of SEQ ID NOS:l-54. 

In a second aspect, the invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a peptide having at least about 70% sequence 
identity, preferably at least about 80% sequence identity, more preferably at least about 
90% sequence identity, and most preferably at lent about 95% sequence identity with a 
TLR-binding peptide of the invention. Of course, the peptides of the invention can be 
) made by other techniques, including peptide synthetic chemistries (e.g., solid state 
peptide synthesis). 



In a third aspect, the invention provides for TLR-binding peptide mimetic 
molecules useful in modulating the natural activity of the TLR receptor in immune 
regulation. Relatedly, the peptides and mimetics of the invention can be incorporated 
into various compositions, including pharmaceutical compositions wherein a pept.de or 
peptide mimetic is combined with a pharmaceutically acceptable carrier. Such 
compositions can be liquid or dry, and can optionally include excipients and other 
molecules. 

In a fourth aspect, the TLR-binding peptides and mimetic molecules of the 
invention are useful in methods of modulating immune responses in immune-mediated 
, diseases. For example, the peptides and mimetic molecules of the invention can be used 
in immune modulation and thus can provide for enhancing innate immune responses, 
such as, for example, inducing or enhancing inflammatory responses. With respect to 
such embodiments, the enhancement can be used, for example, to provide for clearance 
of infectious agents, cancer cells, or to induce or enhance response to a vaccine. 
5 Alternatively,mepeptidescante^ 

TLR In this regard, the peptides and mimetic molecules of the invention can be used to 
bring about down regulation of an immune pathway. For example, down regulation can 
comprise down regulation in autoimmunity such as in Rheumatoid Arthritis (RA), 
Juvenile Idiopathic Arthritis (J1A), Psoratic Arthiritis (PA), Osteoarthritis (OA), 
20 InflammatoryBowelDisease^ 
(DM)- 

The peptides and peptide mimetics of the invention are also contemplated to be 
used to treat cancerous conditions such as melanoma, leukemia, lymphoma, solid 
tumours (lung, liver, kidney, brain, bladder), retinoblastoma, sarcomas and other 

25 connective tissue cancers, and the like. The molecules of the invention are also 

contemplatedfor use in treating immune-mediated pamogemc infections includmg such 
inunune mediated microbial infections as tuberculosis, leprosis, bacterial infections of 
Grampositive and Gram negative microroganisn., HTV/AIDS, Epstem BarrVirus and 
Cytomegalovirus infections, and protozoan infections, such as Leishmania, and the hke. 

30 Use of the peptides and mimetics of the invention are also contemplated in Insulin- 
Dependent Diabetes Mellitus (IDDM), Systemic Lupus Erythematosus (SLE), Sjogren's 
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Syndrome, Scleroderma, Polymyositis, Chronic Active Hepatitis, Mixed Connective 
Tissue Disease, Primary Biliary Cirrhosis, Pernicious Anemia, Autoimmune Thyroiditis, 
Idiopathic Addison's Disease, Vitiligo, Gluten-Sensitive Enteropathy, Graves' Disease, 
Myasthenia Gravis, Autoimmune Neutropenia, Idiopathic Thrombocytopenia Purpura, 
cirrhosis, pemphigus vulgaris, autoimmune infertility, Goodpasture's Disease, Bullous 
Pemphigoid, Discoid Lupus, Ulcerative Colitis, and Dense Deposit Disease). The 
diseases set forth above, as referred to herein, include those exhibited by animal models 
for such diseases, such as, for example Non-Obese Diabetic (NOD) mice for IDDM and 
Experimental Autoimmune Encephalomyelitis (EAE) mice for multiple sclerosis. 

In a fifth aspect, the peptides and mimetics of the invention can be used to 
modulate specific immune responses such as a TH-1 response (i.e., a stimulating 
response wherein proinflammatory cytokines are induced, such as TNFa and INF Y , IL-6, 
IL-2, IL-12), or the T cells that have a regulatory function are induced to bring about 
either a TH2 or TH3 response (i.e., a regulatory response wherein anti-inflammatory 
15 cytokines are induced, such as TGFp, IL-4, and IL-10). 

In a sixth aspect, the peptides and mimetics of the invention in some form affect, 
or alternatively are associated with, activities, or lack thereof, of cytokines and other 
immune mediators, including the following: 

(tumor necrosis factor alpha); LT (lympotoxin); IL-6 
(interleukin6); GM-CSF (granulocyte macrophage colony stimulating factor); M-CSF 
(maciophage colony stimulating factor); LIF (leucocyte inhibitory factor); Oncostatin M 
(oncostatin M); IL-2 (interleukin 2); IL-3 (interleukin 3); IL-7 (interleukin 7); IL-9 
(interleukin 9); IL-12 (interleukinl2); IL-15 (interleukin 1 5); IFNa/p (interferon 
alplia/beta); lFN-y (interferon gamma); IL-17 (interleukin 17); IL-18 (interleukin 18). 
Immunoregulatory responses 

[L^talerteukin 4); IL-10 (inlerleukin 10);IL-11 (interleukin 11); IL-13 (interleuta 13); 
TGF-P (transforming growth factor beta). 
Chemokines responses 

IL-8 (interleukin 8); Groa (melanoma growth stimulating activity); MIP-1 (macrophage 
inflammatory protein); MCP-1 (monocyte chemoattractant protein); ENA-78 (epithehal 
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neutt opMI activating peptide 78); RANTES (regulated upon activatioin T cell expressed 

& secreted). 
Mitogens 

FOF (Fibroblast grow* faCor); PDOF (Platelet-derived growth factor); VEGF (vascular 
5 endothelial growth factor). 

In a related aspect, such cytokines and immune mediators can be blocked by 
TLR-binding peptides of the invention. 

In still another aspect, the invention also provides a novel methodology for 
1 o determining likely TLR-binding peptide motifs by performing in silica computer 

modeling comprising methods based on, among other factors, the sequence and structure 

of the peptides of the invention. 

In yet further embodiments, the peptides of the invention are contemplated to be 
used in diagnostic and in therapeutic applications for patients with autoimmune diseases 
1 5 including cancers and infectious diseases. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows an Antigenic Index graph that shows regions of the protein 
predicted to be exposed on the protein surface plotted above the line and unexposed 

20 regions below the line. 

Figure 2 shows a Kyte/Doolittle hydrophilicity graph showing average 
hydrophiUcity values above the axis and hydrophobic regions below the line. 

Figure 3 shows Surface Probability wherein plots above the line approaching a 
value oflrepresentsahexapeptide centered at such point is predicted to have exposure at 

25 the surface of the protein. 

Figure 4 shows a Chou-Fasman chart indicating regions of helix vs. sheets and 

turns. 

Figure 5 shows an Antigenic Index which plot is smoothed by the refinement of 
the invention. 

30 Figme 6 shows a Hydrophilicity plot which plot is smoothed by the refinement of 

the invention. 
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Figure 7 shows a Surface Probability plot which plot is smoothed by the 

refinement of the invention. 

Figure 8 shows a Chou-Fasman plot which plot is smoothed by the refinement of 

the invention. 

5 Figur e9i S alh re e-din>emional S tickmo<)elofHSP60ftom£.co«. Apreteted 

surf** region is noted k yellow corresponding fte region 426^44 of the ft coli HSP60 
protein, one of the selected peptides. 

Figures 10A-D are a series of graphs showing the effect of different peptides on 
expression of TLR2, TLR4 receptor, and secretion of TNF-alpha and IL-6 
10 proinflammatory cytokines, on monocyte-derived dendritic cells. 

Figure 11 is a graph showing the effect of different peptides on TNF-a secretion 

from a mouse macrophage cell line, RAW264.7. Cell preparation and stimulation for 
TNF-a production were seeded into flat-bottomed 96-well plates (0.1 x 10 cells/200ml 
well). After 18 hrs of incubation, cells were stimulated with the different peptides and, as 
15 a control LPS was used. Supernatants were harvested respectively at 6, 24, 48, and 72 
hours EL1SA assay was used and the amount of TNF-alpha in culture supernatants was 
quantified using a Mouse TNF-a Enzyme-Linked Immunosorbent Assay. TNF-a values 
were extrapolated using a standard curve. Results are expressed in picograms TNF-«/ml. 
Values shown were calculated by subtracting for each peptide the value of unstimulated 

20 cells. LPS data not shown. 

Figure 12 is a diagram showing the general interconnection of cytokine 
stimulation/expression between innate and adaptive immunity pathway. 

Figure 1 3 is a flow chart ouflining the experimental pathway for Examples 2 and 

3 below 

Figure 14 is a graph showing TNF-a production in dendritic cells using real time 



25 

PCR. 



Figures 15A-B are graphs showing percentage of CD8+ cells and TNF-a 
production in dendritic cells (CD83+, HLA-DR+) measured with staining. 
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DETAILED DESCRIPTION OF THE INVENTION 
In a first aspect, peptides comprising core TLR-binding motifs are provided. 
Such motifs may be found in various heat shock proteins, including hsp-60, hsp-70, and 
dnaJ proteins derived from organisms including E. coli (e.g., strain K12), Mycobacterium 
5 tuberculosis, and humans. 



Table 1 



Peptide source * 
Human hsp 60 


\mino j 

r\cid 

Position 

156-162 

1S4-168 

230-241 

300-320 

302-314 

360-375 


\mino Acid Seq. 

KKOSKPVTT 
Fl VKQSKPVTTPEEI 

NTSKGQKCEFQD . .- 

VKAPGFGDNRKNOLKDMAIAT 

APGFGDNRKNQLK 

GKGDKAOIEKRIQEII 


Peptide 
reference 

number 
P22 
P27 
P28 
P3 
P29 
P23 


Seq. 
Id. 

No. 
22 
27 
28 
2 

29 
23 
30 


E. co//hsp 60 


360-376 

380- 400 

381- 398 

382- 396 
547-558 
195-211 

345- 375 

346- 372 
348-370 

387- 396 

388- 396 
426-444 
429-439 


GKGDKAOIEKRIQE11E 
VTTSEYEKEKLNERLAKLSDG 
TTSEYEKEKLNERLAKLS 
TSEYEKEKLNERLAK 
TE1PKEEKDPGM 
FDRGYLSPYFINKPETG 
nvAniWOOTEEATSDYDREKLOERVAKLAGG 

VAOIROOIEEATSDYDREKLQERVAKL 

OIROQIEE ATSDYDREKLOER V A 

VEMKEKKARV 

EMKEKKAR 
LADLRGQNEDONVGIKVAL 

LRGONEDONVG 


P30 
P4 
P31 
P24 
P32 
P33 
P2 
P34 
P35 
P36 
P25 
PI 
P37 
P38 


4 

31 
24 
32 
33 
6 
34 
35 
36 
25 
8 

XI 

J 1 1 

38 


E. coli hsp 70 

Mycobacteria 
hsp 60 


95- 110 

96- 117 
159-172 
249-268 
253-266 
583-596 
59-68 

133-143 
135-143 
204-213 

342- 370 

343- 370 
346-367 
346-368 
383-402 


DIOORYPHLPYOFOAS 

IOORYPHLPYOFOASENGLPM1 
YFDDAQROGTKDAA 
IREOAGIPDRSDNRVORELL 

AfirPDRSDNRVORE 
OAIKNVDKOTODFA 
IELEDPYEKI 

GAKEVETKEQI 

KEVETKEQ 
TDPEROEAVL 

GRVAOIRQEIENSDSDYDREKLgtKLAKL 

" RVAQlRQFTRNSnSDYDREKLQERLAKL 

QIRQEIENSDSDYDREICLOERL 
QTROEIENSDSDYDREKLOERLA 
TEVELKERKHRIEDAVRNAG 


P7 
P39 
P8 
P40 
P41 
P42 

P43 
P26 
P44 

P5 
P45 
P12 
P46 

P6 


1 

39 
3 

40 
41 
42 

43 
26 
44 
10 
45 
11 
46 
12 
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£c „,mSP60pepMe sequence 34 5 .375(P2)^M^^«P M 
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(R.345) has been substituted with glycine (G.345), and with respect to P6, lysine 402 
(K.402) has been substituted with glycine (G.402). 

5 Sequencing algorithms can be used to measure homology or identity between 

known and unknown sequences. Such methods and algorithms are useful in identifying 
corresponding sequences present in other organisms as well as in the design of peptides 
of the invention. Homology or identity is often measured using sequence analysis 
software (e.g., Sequence Analysis Software Package of the Genetics Computer Group, 
10 University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 
53705). Such software matches similar sequences by assigning degrees of homology to 
various deletions, substitutions and other modifications. The terms "homology" and 
"identity" in the context of two or more nucleic acids, polypeptide, or peptide sequences, 
refer to two or more sequences or subsequences that are the same or have a specified 
15 percentage of amino acid residues or nucleotides that are the same when compared and 
aligned for maximum correspondence over a comparison window or designated region as 
measured using any number of sequence comparison algorithms or by manual alignment 
and visual inspection. 

20 For sequence comparison, typically one sequence acts as a reference sequence, to 

which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are 
designated, if necessary, and sequence algorithm program parameters are designated. 
Default program parameters can be used, or alternative parameters can be designated. 

25 The sequence comparison algorithm then calculates the percent sequence identities for 
the test sequences relative to the reference sequence, based on the program parameters. 

A "comparison window," as used herein, includes reference to a segment of any 
one of the number of contiguous positions selected from the group consisting of from 8 to 
30 10, 10 to 20, 20 to 600, usually about 50 to about 200, more usually about 100 to about 
150 in which a sequence may be compared to a reference sequence of the same number 
of contiguous positions after the two sequences are optimally aligned. Methods of 
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alignment of sequence for comparison are well-known in the art. Optimal alignment of 
sequences for comparison can be conducted, e.g., by the local homology algorithm of 
Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment 
algorithm of Needleman & Wunsch, J. Mol. Biol 48:443 (1970), by the search for 
5 similarity method of person & Lipman, Proc. Natl Acad. Set USA 85:2444 (1988), by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 
Science Dr., Madison, WI), or by manual alignment and visual inspection. 

10 One example of a useful algorithm is BLAST and BLAST 2.0 algorithms, which 

are described in Altschul et al. , Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et al. , 
J. Mol. Biol. 215:403-410 (1990), respectively. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

1 5 sequence pairs (HSPs) by identifying short words of length W in the query sequence, 
which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood 
word score threshold (Altschul et al. , supra). These initial neighborhood word hits act as 
seeds for initiating searches to find longer HSPs containing them. The word hits are 

20 extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Cumulative scores are calculated using, for nucleotide sequences, 
the parameters M (reward score for a pair of matching residues; always >0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of 
the word hits in each direction are halted when: the cumulative alignment score falls off 

25 by the quantity X from its maximum achieved value; the cumulative score goes to zero or 
below, due to the accumulation of one or more negative-scoring residue alignments; or 
the end of either sequence is reached. The BLAST algorithm parameters W, T, and X 
determine the sensitivity and speed of the alignment The BLASTN program (for 
nucleotide sequences) uses as defaults a word length (W) or 1 1 , an expectation (E) or 10, 

30 M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as defaults a word length of 3, and expectations (E) of 10, and the 



11 



BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 
89:10915 (1989)) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a 
comparison of both strands. 

5 The BLAST algorithm also performs a statistical analysis of the similarity 

between two sequences (see, e.g., Karlin & Altschul, Proc. Natl. Acad. Sci. USA 90:5873 
(1993)). One measure of similarity provided by BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 

10 nucleic acid is considered similar to a references sequence if the smallest sum probability 
in a comparison of the test nucleic acid to the reference nucleic acid is less than about 
0.2, more preferably less than about 0.01, and most preferably less than about 0.001. 

Other algorithms for determining homology or identity include, for example, in 
15 addition to a BLAST program (Basic Local Alignment Search Tool at the National 
Center for Biological Information), ALIGN, AMAS (Analysis of Multiply Aligned 
Sequences), AMPS (Protein Multiple Sequence Alignment), ASSET (Aligned Segment 
Statistical Evaluation Tool), BANDS, BESTSCOR, BIOSCAN (Biological Sequence 
Comparative Analysis Node), BLIMPS (BLocks IMProved Searcher), FASTA, Intervals 
20 & Points, BMB, CLUSTAL V, CLUSTAL W, CONSENSUS, LCONSENSUS, 

WCONSENSUS, Smith-Waterman algorithm, DARWIN, Las Vegas algorithm, FNAT 
(Forced Nucleotide Alignment Tool), Framealign, Framesearch, DYNAMIC, FILTER, 
FSAP (Fristensky Sequence Analysis Package), GAP (Global Alignment Program), 
GENAL, GIBBS, GenQuest, ISSC (Sensitive Sequence Comparison), LALIGN (Local 
25 Sequence Alignment), LCP (Local Content Program), MACAW (Multiple Alignment 
Construction & Analysis Workbench), MAP (Multiple Alignment Program), MBLKP, 
MBLKN, PIMA (Pattern-Induced Multi-sequence Alignment), SAGA (Sequence 
Alignment by Genetic Algorithm) and WHAT-IF. Such alignment programs can also be 
used to screen genome databases to identify polynucleotide sequences having 
30 substantially identical sequences. A number of genome databases are available, for 
example, a substantial portion of the human genome is available as part of the Human 
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Genome Sequencing Project (J. Roach, http://weber.u. Washington.edu/~roaclVhunian_ 

genome, progress2.html) (Gibbs, 1995). At least twenty-one other genomes have 

already been sequenced, including, for example, M. genitalium (Fraser et aL, 1995), M 

jamwchii (Bultef a/., 1996), H. influenzae (Fleischmann etal, 1995), E. coli (Blattner 

5 et aL, 1997), and yeast (S. cerevisiae) (Mewes et aL, 1997), and D. melanogaster (Adams 

et aL, 2000). Significant progress has also been made in sequencing the genomes of 

model organism, such as mouse, C. elegans, Arabadopsis sp. and D. melanogaster. 

Several databases containing genomic information annotated with some functional 

information are maintained by different organization, and are accessible via the internet, 

10 for example, http://wwwtigr.org/tdb; http://www.genetics.wisc.edu; http://genome- 

www.stanford.edu/~ball; http://hiv-web.lanl.gov; http://www.ncbi.nlm.nih.gov; 

http://www.ebi.ac.uk; http://Pasteur.fr/other/biology; and http:/www.genome.wi.mit.edu. 

Further, selection of HSP-derived peptides may be carried out by, such as, the 

following protocol wherein HSPs from various species are obtained using the Entrez 

1 5 retrieval system (http://www.ncbi.nlm.nih.gov/Entrez/) at the National Center for 

Biotechnology Information (NCBI) flittp-//www.ncbi.nlm.mh.gov/). The Entrez retrieval 

system allows for searching several linked publicly available databases and it provides 

access to the date-bases listed below. Any of these data bases may be employed in the 

methods of the invention, such as any of the following: 

20 Databases 

-PubMed: biomedical literature 
-Nucleotide: sequence database (GenBank) 
-Protein: sequence database 

<http://www.ncbi.nlm.nih.gov/entrez/query .fcgi?db=Protein> 
25 -Structure: three-dimensional macromolecular structures 

-Genome: complete genome assemblies 

-Books: BookShelf online books 

-Domains: conserved domains (CDD) 

-3D Domains: domains from Entrez Structure 
30 -GEO: Gene Expression Omnibus 

-GEO Datasets: curated GEO data sets 

•Journals: journals in Entrez 

-MeSH: medical subject headings 

-NCBI Web Site: NCBI Web site search 
35 -OMTM: Online Mendelian Inheritance in Man 

-PMC: full-text digital archive of life sciences journal literature 
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-PopSet: population study datasets 
-SNP: single nucleotide polymorphisms 
-Taxonomy: organisms in GenBank 
-UniGene: gene-oriented clusters of transcript sequences 
5 -UniSTS: markers and mapping data 

Protein sequences are obtained such as the list of the retrieved HSP amino acid sequences 
from various species used for selecting the peptides, such as for example: 
Escherichia coli 

1 0 -hsp60: DEFINITION: GroEL, chaperone Hsp60, peptide-dependent ATPase, heat 
shockprotein [Escherichia coli K12]. VERSION: NP_418567.1 GI:16131968 
-hsp70: DEFINITION: heat shock protein, chaperone, member of Hsp70 protein family 
[Escherichia coli K12]. VERSION: NP_417021.1 GI:16130451 
-dnaj: DEFINITION: Chaperone protein dnaJ (Heat shock protein J) (HSP40). 

1 5 VERSION: P08622 GI:1 18719 
Homo sapiens 

-hsp60: DEFINITION: heat shock 60kDa protein 1 (chaperonin); heat shock 60kD 
protein l(chaperonin) [Homo sapiens]. VERSION: NP_002147.1 GI:4504521 
-hsp70: DEFINITION: Heat shock 70 kDa protein 1 (HSP70.1) (HSP70-1/HSP70-2). 
20 VERSION: P08107 GI:462325 

-dnaj: DEFINITION: DnaJ homolog subfamily B member 1 (Heat shock 40 kDa protein 
1) (Heat shock protein 40) (HSP40) (DnaJ protein homolog 1) (HDJ-1). VERSION: 
P25685 GI: 1706473 

Mycobacterium tuberculosis 
25 -hsp60: DEFINITION: heat shock 60kDa protein 1 (chaperonin); heat shock 60kD 
protein l(chaperonin) [Homo sapiens]. VERSION: NP_002147.1 GI:4504521 
-hsp70: DEFINITION: Heat shock 70 kDa protein 1 (HSP70.1) (HSP70-1/HSP70-2). 
VERSION: P08107 GI.462325 

Following retrieval of the sequences of interest, the retrieved sequences were 
30 imported or transferred to Mac Vector 7. 1 . 1 Trial Version to analyze the primary 

structure of these proteins using algorithms contained in this software package, able to 
originate secondary structure predictions based on the analysis of the whole protein 
primary sequence. Several tools were then used including the following: 
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-Antigenic Index: This index uses information that goes beyond single type 
analysis and the algorithm combines information from hydrophylicity, surface probability 
and backbone flexibility predictions along with the secondary structure predictions of 
other algorithms such Chou-Fasman and Gobson-Garnier. profiles originated using this 
5 algorithm identify regions of a given amino acid sequence that is predicted to be exposed 
to the surface and therefore that may be antigen sites. This analysis combinations is able 
to produce a composite prediction of the surface contour of a given protein. Regions of 
the protein predicted to be exposed at the protein's surface are plotted above the 
graph axis and vice versa non exposed regions are plotted below the graph axis (See 
10 Figs). 

-Kite-Doolittle Hydrophilicity Profile: This algorithm profiles graphs of the 
local hydrophilicity of a protein along its aminoacid primary sequence. The rationale is 
based on the assignment of a hydropathy values to each of the 20 amino acids based on 
some experimental of empirical measure. Then a window of size N run along the length 

15 of the protein and each hydropathy value is summed and then divided by N to obtain an 
average hydrophilicity for the window. Average hydrophilicity values are then plotted to 
assign values above the axis denote hydrophilic regions and potentially exposed to the 
surface of the protein. Conversely, values below the axis indicate hydrophobic regions 
that tend to be buried inside the molecule or inside hydrophobic environments such as 

20 membranes. The Kite-Doolittle scale is the most commonly used hydrophathy scale and 
its values are assigned using a combination of water-vapor transfer free energy for amino 
acid side chains and the preference of amino acid side chains for interior or exterior 
environments, with small adjustments made to the final values based on experimental 
experience. 

25 -Surface Probability: This algorithm predicts which regions of a protein are most 

likely to lie on the protein surface, based on the knowledge of which amino acids are 
more likely to be found on the protein surface of proteins of known structure. The 
algorithm employs a determination of solvent-accessible surface area of each residue. 
Residues were then classified as "buried" if their accessible surface are was smaller than 

30 20 angstroms and "exposed" if the accessible surface area was larger than 60 angstroms. 
For each of the 20 amino acids information was compiled on what percent of the time the 
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amino acids were found to be exposed or buried in a sample of proteins of known atomic 
structure. In order to calculate the fractional surface probability for each amino acid, 
Mac Vector sums the six fractional probabilities of the amino acids in the window and 
divides by six fractional probabilities of the amino acids in the selected window and 
5 divides by six to yield a running average of the fractional surface probability along the 
length of the protein. Any value close to LO at any point in the sequence of the protein 
would mean that the hexapeptide centered about that point is predicted to be exposed at 
the surface. Conversely, values close to 0.0 will mean that the hexapeptide is buried in 
the interior of the protein.(32,33). 

1 0 -Chou-Fasman: This is a widely used method of secondary structure prediction 

based on known X-ray structures. The authors compiled statistics on the tendency of an 
amino acid to appear in a given secondary structure and used these statistics to assign the 
20 amino acids into four classes: helix formers, helix breakers, sheet formers and sheet 
breakers. The algorithm then predicts the structure of proteins by locating clusters of 

15 helix- or sheet-forming residues in the amino acid sequence and applying a set or rules to 
determine if these clusters are significant enough to nucleate a helix or a beta sheet 
structure. Using the MacVector version, the prediction or conformation is treated 
independently of the others and each structure prediction is graphed separately. 

Once the above analysis is performed, the values assigned by the three above 

20 mentioned algorithms and originated score values depending of the algorithm were 

visualized in the form of a Data Summary Table (DST) or in the form of a Data Plot (DP) 
as provided in the following examples below. Additionally, in analyzing DST and DP we 
selected regions of the proteins containing structures denominated a-helix and 0-turns (or 
loops) that are characterized by appearing on the surface of proteins and usually reversing 

25 the direction of a chain. 0-turns refer to a section of a polypeptide connecting regions of 
defined secondary structure. In many proteins loops contain functional residues in that 
they tend to be more flexible in conformational changes than helices and sheets and 
therefore tend to be relevant part of binding sites. Also, in case we had a certain "core 
putative TLR-binding region" normally a 0-turns predicted to be hydrophilic and surface- 

30 exposed we selected flanking regions on both sides predicted to be hydrophobic or not- 
surface-exposed but still predicted to be a-helix. In case such prediction was not 
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satisfactory we made substitutions of hydrophilic amino acid with other amino acids of 
the appropriate characteristic. 

EXAMPLE I: Selection of Peptide: eHSP60pl 

5 A Data Summary Table was produced for eHSP60-Pl . The primary amino acid 

sequence of E. coli HSP60 was analyzed using the combination of four different 
algorithms described above. The result of this analysis was displayed in a Data Summary 
Table I that shows region 426-444 of the primary amino acid structure of E. coli HSP60. 
Of the entire sequence analysis we selected regions that had a "Helix-Turn-Helix" 

1 0 predicted structure. In italics is highlighted the p-turn region. In plain print the flanking 
regions having a predicted a-helix structure. 



Data Summary Table I 
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35 Next, a Data Plot was generated wherein we analyzed the same region of the K 

coli HSP60 protein sequence originated by the same computer analysis done using a 
combination of the of four different algorithms described above (See Figs. 1-4). This 
analysis confirmed a "core sequence" including the amino acids 429-438 that was 
predicted to be surface exposed by the four different algorithms and matched with the 

40 "Helix-Turn-Helix" predicted structure by the Chou-Fasman analysis. 

The combination of Summary Table and Data Plot provided for prediction and 
identification of the regions of the protein that have the highest surface exposure 



17 



prediction. For the selection of the peptides predicted to play a role in interacting with 
TLR on monocytes and monocytes derived DC we selected regions of the proteins which 
had high scores for all the three selected algorithms, i.e. -Antigenic Index; -Kite-Doolittle 
Hydrophilicity; -Surfece Probability for the "core region". Flanking regions had less 
5 surface exposure prediction score values and that satisfied the rule of "Helix-Turn-Helix" 
structure. 

Additionally, we "refined" the selection found by the above analysis by re- 
analyzing the selected region, i.e. peptide, alone using the same algorithms and 
comparing these predictions with the same region within the original protein. If changes 

10 occurred in the score values we made changes of the amino acids of the flanking ends to 
make sure that the two predictions were as close as possible and maintained or 
acquired the "Helix-Turn-Helix" structure. This refinement produced the affect on 
Antigenic Index, hydrophilicity, Surface Probability, and Chou-Fasman as shown in 
Figures 5-8 wherein region 426-444 of the primary amino acid structue of E. coli HSP60 

1 5 is reanalyzed using the combination of four different algorithms described above. 

As an example of such refinement, the invention peptides include derivative 
peptides of wild-type peptides such as P2 and P6 disclosed in Table I. These single 
amino acid changes were brought about because in the computer analysis these 
substitutions generated a "predicted-structure" more likely to be surface-exposed. 

20 Further refinement of the peptide selection was carried out (based on computer 

models if available). For example, a three dimensional computer model of £ coli HSP60 
protein was used to examine the position of the PI peptide. Selected regions were 
originated using as source the structural coordinates of the crystallized K coli protein 
Groel (otherwise called HSP60) and visualized using Cn3D4. This software is a 

25 visualization tool for biomolecular structures, sequences, and sequence alignments and is 
publicly available at: <http://www.ncbi.nlm.iuh.goW^ 1° 
yellow is highlighted the region 426-444 of the E. coli HSP60 protein. It can be noted 
that the region 426-444, as predicted, is surface exposed (See Figure 9). 

30 EXAMPLE 2: Effect of HSP-derived peptides on immune modulation 
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HSP-derived peptides that bind TLR receptors can affect expression of TLR2, 
TLR4 receptor and secretion of TNF-alpha and IL-6 proinflammatory cytokines, on 
monocyte-derived dendritic cells. Here, we show that human heat shock protein 60 
(hHSP60) elicits a potent pro-inflammatory response in cells of the innate immune 

5 system and therefore has been proposed as a danger signal of stressed or damaged 
cells. Previous work showed that mouse or human macrophages, as well as endothelial 
or smooth muscle cells, were found to elicit a pro-inflammatory response when incubated 
with recombinant hHSP60. The response included the up-regulation of adhesion 
molecule expression and the release of inflammatory mediators such as IL-6 and TNF- 

1 0 alpha. Since autologous hsp60 may be aberrantly expressed on the cell surface in 

response to stress and will be set free from the cell interior during necrosis, these findings 
point to a role of hHSP60 in initiating or sustaining Thl- dependent tissue inflammation. 
Interestingly, microbial HSP60/65 also induces a pro-inflammatory response in innate 
immune cells suggesting that damaged autologous cells and microbial pathogens may 

1 5 alert innate immunity via the same recognition system. TLR4 receptor has been 
identified to mediate HSP60 signaling while TLR2 appears to be important in LPS 
binding and signaling. We decided to investigate if peptides derived from 
phylogenetically separate hsp60 species would interact with innate immune cells via 
different recognition pathways as Tlr2 and Tlr4 As shown in Figure 10A-D, monocyte- 

20 derived dendritic cells respond to the different peptides with a kinetics that revealed peak . 
levels of expression at 6 hours after stimulation. PI, P3, and P4 were able to induce, at 
various levels, expression of TLR2 and TLR4 receptor. Also TNF-alpha and IL-6 
cytokine secretion was measured with PI, P3 and P4. These data indicate correlation 
between receptor expression and pro-inflammatory cytokines secretion. TNF-alpha and 

25 IL-6 production was induced with P2 at 6 hrs (Fig. 10C-D) but very low level of 

Tlr2 and Tlr4 receptor expression was detected (Fig. 10A-B). This result may be due to a 
different activation pathway by the P2 peptide compare to the previous peptides. 
Interestingly, PI was affecting TLR2 and TLR4 expression at 48 hrs and not at 24 hrs 
(Fig. 10A-B), while P4 was giving TLR4 expression up to 24 hrs after stimulation (Fig. 

30 10B) but no cytokine secretion was measurable. As a positive control, HSP60 was used 
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for all parameters analyzed (Figs. 10A-D) while LPS was used for TNF-alpha and IL-6 
induction. 

Materials and Methods 

5 Monocytes and DC preparation 

PBMC were derived by Ficoll-Hypaque density gradient centrifugation from 
buffy coats of healthy blood donors obtained from the San Diego Blood Bank. CD14- 
positive monocytes were enriched to 93% purity by Percoll density gradient 
centrifugation. DC were generated by previously isolated monocytes in IMDM medium 

10 supplemented with 1% human AB serum, GM-CSF and IL-4 for 6 days, replacing 50% 
of the culture medium every other day. 

Induction ofTLRZ TLR4 and cytokine release from monocvte-derived dendritic cells 

DC (1x106 cells/well) were stimulated with the different peptides and as a control 

HSP and LPS were used. Cells were harvested respectively at 6, 24, 48, and 72 hours. 
15 Flow cytometry assay. 

Specific cell surface staining was performed using FITC, PE and Cy-conjugated 

antibody (Ab) in saturating amounts. Specific isotype controls were used for each 

sample. Anti-TLR-2, anti-TLR-4, anti-CD86, anti-CD83, anti-HLA-DR, anti-TNF-a and 

anti-IL-6 Abs were used. Results were expressed as the percent of positive cells for a 
20 given marker. The relative percent of fluorescence intensities was calculated as the value 

of the percent of fluorescence intensities of the respective marker Ab minus the specific 

isotype control and the values obtained with non-stimulated cells. 

EXAMPLE 3: Effect of HSP-derived peptides on secretion of TNF-alpha from a mouse 

25 macrophage cell line, RAW264.7. 

As discussed above, we wanted to investigate if peptides derived from 
phylogenetically separate hsp60/70 species would interact and elicit an inflammatory 
response in a mouse monocyte/macrophage cell line. As shown in Figure 11, RAW264.7 
cells respond to the different peptides with a kinetics that revealed peak levels of 

30 expression at 72 hours after stimulation. PI 0 (hsp70 human derived peptide) and P3, P4 
(hsp60 human derived peptide) are inducing a very strong TNF-a secretion. Also, PI and 
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P2 (HSP60 £ co/i-derived peptide) and P5, P6 (hsp60 Mycobacteria-derived peptide) are 
inducing a very comparable TNF-a secretion. We conclude that phylogenetically 
separate hsp60 / 70 can induce an inflammatory response, as indicated by the medically 
relevant mouse cell line model. 

5 

EXAMPLE 4: Diagnostic Application 

In this example, the peptides of the invention are used for diagnosis of patients 
with Autoimmune Diseases. Specifically, a panel of the selected/identified HSP-derived 
peptides are used to screen patient PBMC-derived Monocytes, Macrophages and 

1 0 Monocyte derived-dendritic cells as well as other immune cells expressing TLR Such 
screening can be used to identify specific TLRs that can be then "targets" for anti- 
inflammatory therapy. Such TLRs can be disease specific. This then provides for 
autoimmune specific/tailored immune intervention based on the inhibition of innate 
immune mediated proinflammatory pathway activation. 

1 5 Natural as well as synthetic antigen-specific therapeutics derived from TLR 

binder peptides can be then used to interfere/inhibit the binding of HSP and their 
derivatives to TLRs and therefore inhibiting or interfering with the resulting triggered 
inflammatory response in a disease-specific fashion. 

20 EXAMPLE 5: Therapeutic Application: 

Therapeutic applications comprise three general medical areas including cancers, 
infectious diseases and immunodeficiencies and in each of these areas, the HSP-derived 
peptide sequences of the invention may be identified for interacting with specific TLR 
allowing additionally the capability of designing peptides or peptidomimetics which bind 

25 directly to TLR and induce a pro-inflammatory response. Clinical applications range 
from Cancer (systemic or local) to infectious diseases (including granuloma) and 
immunodeficiencies. In another aspect, clinical applications include use of the peptides 
as adjuvants in vaccine formulations against a whole range of infectious disease and 
cancer. In yet another aspect, the peptides of the invention can be applied to autoimmune 

30 diseases, for example, the HSP-derived peptide sequences can be identified and designed 
or peptidomimetics created for interacting with or binding directly to specific TLR to 
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avoid inducing proinflammatory responses. In this instance, the invention peptides are 
contemplated to act in the form of, for example, molecular competitor/inhibitor of natural 
HSP, or triggers of DC-2-type responses (tolerogenic) by acting on and triggering 
different activation pathways. Further, in this example, identified HSP-derived peptide 
5 sequences interacting with specific TLR can be used for the development/identification 
of natural as well as synthetic antigen-specific therapeutics able to interfere/inhibit the 
binding of HSP and their derivatives to TLRs and therefore inhibiting or interfering with 
the resulting triggered inflammatory response. 

Inhibition or interference with the inflammatory response include the inhibition of 
10 all or part of the proinflammatory mediators such as TNF-a or IL-6 and of the Thl- 
promoting cytokines IL-12 and IL-15 and possibly other cytokines and mediators 
secreted as a results of interaction between both bacterial and human HSPs and TLR 
expressed on Monocytes, Macrophages and Monocyte-derived Dendritic cells as well as 
other immune cells expressing TLR. 
1 5 Natural as well as synthetic antigen-specific therapeutics derived from the HSP- 

derived peptides able to interfer/inhibit the binding of HSP and their derivatives to TLRs 
can be preferably admnristered. Further, peptido-mimetics synthetic molecules able to 
block specific TLR involved in the induction of proinflammatory cytokine secretion can 
be designed. Further still, immunization with TLR identified in the screening assay can 
20 be accomplished to bring about lowering of inflammatory responses. 
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ABSTRACT 

Toll receptor-binding peptides are disclosed. Also disclosed are methods of using 
5 such peptides in modulating immune responses in immune-mediated diseases. Further 
disclosed are in silico methodologies for identifying TLR-binding peptide motifs. 
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Chou Fasman 2° structure: Window - 7 
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Figure 13 
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Figure 14 

TNF-a production of Dendritic Cells measured with Real Time PCR 




24 hours after stimulation 



Figure 15A (top) and B(bottbm) 
Percentage of CD8~a positive cells and TNF-a production by Dendritic cells 

(CD83+ HLA-DR+) measured with staining 




24 hours after stimulation 
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